CDS

Accession Number TCMCG075C17541
gbkey CDS
Protein Id XP_017977094.1
Location join(34610484..34611728,34612062..34612265,34612688..34612792,34613273..34613338,34613421..34613585,34614843..34614917,34615018..34615169,34615537..34615672,34615757..34615826,34615919..34616046,34616170..34616226,34616580..34616765,34616877..34616969,34617180..34617265,34617350..34617416,34617500..34617566,34617666..34617802,34618706..34618800,34618924..34618990,34619081..34619245)
Gene LOC18599854
GeneID 18599854
Organism Theobroma cacao

Protein

Length 1121aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018121605.1
Definition PREDICTED: importin-5 isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category UY
Description Importin-5-like
KEGG_TC 1.I.1
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03009        [VIEW IN KEGG]
KEGG_ko ko:K20222        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCCACCGATCCAACTCAGCTTCAACTCGCCCAACTCGCCCACATCTTGGGCCCCGACCCGACCCACTTCGAAACCCTAATCTCCCACCTCATGTCTTCCTCCAACGACCAACGTTCCCAAGCTGAGTCACTCTTCCACCTCGCCAAACAGACTCAACCCGACTCACTCTCACTCGCTCTCTCCCGTGTCCTATCCTCCTGTTCCCGTCCCGAACTCCGTGCCCTCTCCGCTGTCCTTCTCCGCAAAATCCTAACTCCTGCCGCTGATTCCTCGTTTCTCTTCCCTCTCCTAGCTGAAACCACACGCGCCGCCATCAAATCCGCGCTTTTATCCACTCTCCAAACTGAACAATCCAAAACTAACGTCAAAAAACTCTGTGATACCATCTCTGAACTCGCCTCCTCCATCGTCGCTATTGGAGGTTGGCCGGAGCTATTACCGTTCCTCTTCCAATGCGTGAATTCTCCAAATCCGAATCTTCAAGAATCTGCTCTTTTGATCTTCTCTCGATTAGCTCAGAACATCGGAGAAACAACCGAAACCCTAATCCCTCATTTAAATACTCTCCATTCCGTTTTCTTCAAATGTTTATCGAATCCTTCAAGCTGCGATGTTCGAATTGCAGCTTTAAGCGCTTCAATCAGTTTTATTCAATGCATTTCAAATGGCAAAGATCGGGACACTTTTCAAGATTTATTGCCGTTGATGATGCAGACGTTGACCGAGGCGTTGAATTCGGGACTAGAAGCCACCGCTCAGGAGGCTTTGGAATTGTTAATTGAGTTAGCCGGGTCGGAGCCGAGGTTTCTAAGGAGGCAACTAATGGAAGTAGTTGGATCAATGTTGCAAATAGCGGAGGCGGAGAGTTTGGAAGAAGGGACACGCCATTTGGCAGTTGAGTTTGTTATTACGTTAGCAGAGGCGAGAGAGAGGGCTCCAGGGATGATGAGGAAGTTACCGCAGTTTATAAGGAGGTTATTTGGAGTGTTGATGAATATGTTGTTGGATGTTGAGGATGAACAGGATTGGTATAATGCCGAGAGTGAGGATGAGGATGCCGGGGAAACAAGTAATTATGCGGTTGGTCAGGAGTGTTTGGATAGATTGTCGATTTCTTTAGGAGGGAATACGGTTGTTCCAGTGGCTTCGGAGTTGTTTCCTGTGTTCTTGGCTGCTGCAGAGTGGCAGAAACGACATGCTGCTCTCATTGCACTTGCACAGATTGCTGAGGGTTGTTCTAAGGTGATGATAAAAAATCTAGAACAAGTAGTGTCAATGGTTTTGAATTCATTTCAGGATGCCCATCCTCGTGTTCGATGGGCAGCTATTAATGCCATTGGACAGTTGTCTACGGACTTGGGCCCAGAGTTGCAGTCACAATTTCATCATAAAGTTTTGCCTGCACTAGCTGGAGCCATGGATGACTTTCAAAATCCTCGCGTGCAGGCCCATGCTGCTTCAGCAGTCCTTAATTTCAGTGAAAATTGTACCCCAGACATCTTAACTCCATATCTGGATGGGATCGTGAGCAAACTTCTTGTACTTCTTCAGAATGGGAAGCAGATGGTACAGGAGGGTGCTTTAACAGCTTTGGCATCAGTTGCTGATTCATCTCAGGAGCAATTCCAAAAGTATTATGATGCTGTGATGCCTTACTTGAAAGCTATCTTGGTGAATGCAAATGATAAAGCTAATCGCATGCTTCGTGCCAAAGCTATGGAGTGCATTAGTTTGGTTGGAATGGCTGTTGGGAAGGACAAATTTAGGGATGACGCTAAACAGGTTATGGAAGTCTTGATGTCATTGCAAGGATCTCAAATGGAGTCGGATGATCCAACAACGAGCTACATGTTGCAAGCTTGGGCCAGACTTTGCAAGTGCCTTGGGCAGGATTTTCTTCCTTACATGAGTGTTGTCATGCCCCCTTTGCTTCAATCTGCTCAGCTTAAGCCTGATGTAACCATTACATCAGCTGATTCTGATGCTGATATTGATGATGATGACGAAAGCATTGAGACTATTACGCTTGGGGATAAAAGAATAGGGATTAAGACTAGTGTCTTGGAAGAAAAAGCTACGGCTTGCAACATGTTATGTTGTTATGCTGACGAGTTAAAGGAAGGATTCTTCCCATGGATCGATCAGGTAGCTACTACTTTAGTTCCCCTTCTAAAATTTTATTTCCATGAAGAAGTTCGGAAGGCAGCTGTTTCAGCCATGCCGGAGCTGTTAAGTTCAGCTAAGTTAGCTATTGAGAAGGGGCAATCTCAAGTTCGAAATGAAACATATGTAAAACAGTTAACTGATTACATAATACCAGCTTTGGTGGAGGCTTTACACAAGGAGCCTGAGGTAGAAATTTGTGCAAGCATGTTGGATTCCTTGAATGAATGCTTACAGGTTGCTGGACCATTTCTTGATGAGGGCCAAGTAAGGTGCATTGTCGATGAGATTAAACAGGTGATCACAGCTAGCTCAGCTAGAAAACAAGAAAGAGCAGAAAGGGCTAAAGCAGAGGACTTTGATGCGGAAGAAGGCGAAATGCTTGAGGAGGAAAATGAGCAAGAAGAAGAAGTTTTCGGTCAAGTTGGTGATTTGTTGGGCACGTTGATCAAAACATTCAAGGCATCCTTCTTACCTTTCTTCCAGGAGCTTACATCTTATGTAATGCCTATGTGGGGTAAGGATAAAACAGCAGAAGAAAGGAGAATCGCCATTTGTATTTTTGATGATGTTGCAGAGCATTGTCGTGAGGCAGCTCTTAAGTATTATGACACATATCTTCCTTTCGTATTGGAAGCTTGCAATGATGAAAATCCTGATGTTCGTCAGGCAGCAGTTTATGGGCTTGGTGTTTGTGCAGAGTTTGGTGGATCTGTATTCAAACCTCTTGTTCGAGAGGCCCTCTCGAGGCTGGATGCTGTTATTAGACATCCTAATGCATTGCATGCAGACAATGTGATGGCTTATGACAATGCTGTTTCAGCTCTTGGGAAAATATGCCAATTTCATCGGGATAGTATAGATGCAGCTCAGCAGATTGTTCCTGCTTGGTTAAGTTGCTTGCCCATAAAAGGTGATCTGATTGAGGCCAAGCTTGTTCATGATCAGCTATGTTCAATGGTTGAAAGGTCAGATCAGGAACTTTTAGGGCCCAACAATCAATATCTTCCCAAGATAGTAGCAGTTTTTGCAGAGGTTTTATGTGCCGGTAAAGATCTGGCAACGGAGCAAACTGCTAGTAGAATGATTAATCTGTTAAGGCATCTTCAGCAATCATTACCTCCATCTACACTAGCATCAACCTGGTCATCCCTGCAGCCACAGCAGCAGCTTGCTTTGCAATCAATTCTGTCATCTTAA
Protein:  
MATDPTQLQLAQLAHILGPDPTHFETLISHLMSSSNDQRSQAESLFHLAKQTQPDSLSLALSRVLSSCSRPELRALSAVLLRKILTPAADSSFLFPLLAETTRAAIKSALLSTLQTEQSKTNVKKLCDTISELASSIVAIGGWPELLPFLFQCVNSPNPNLQESALLIFSRLAQNIGETTETLIPHLNTLHSVFFKCLSNPSSCDVRIAALSASISFIQCISNGKDRDTFQDLLPLMMQTLTEALNSGLEATAQEALELLIELAGSEPRFLRRQLMEVVGSMLQIAEAESLEEGTRHLAVEFVITLAEARERAPGMMRKLPQFIRRLFGVLMNMLLDVEDEQDWYNAESEDEDAGETSNYAVGQECLDRLSISLGGNTVVPVASELFPVFLAAAEWQKRHAALIALAQIAEGCSKVMIKNLEQVVSMVLNSFQDAHPRVRWAAINAIGQLSTDLGPELQSQFHHKVLPALAGAMDDFQNPRVQAHAASAVLNFSENCTPDILTPYLDGIVSKLLVLLQNGKQMVQEGALTALASVADSSQEQFQKYYDAVMPYLKAILVNANDKANRMLRAKAMECISLVGMAVGKDKFRDDAKQVMEVLMSLQGSQMESDDPTTSYMLQAWARLCKCLGQDFLPYMSVVMPPLLQSAQLKPDVTITSADSDADIDDDDESIETITLGDKRIGIKTSVLEEKATACNMLCCYADELKEGFFPWIDQVATTLVPLLKFYFHEEVRKAAVSAMPELLSSAKLAIEKGQSQVRNETYVKQLTDYIIPALVEALHKEPEVEICASMLDSLNECLQVAGPFLDEGQVRCIVDEIKQVITASSARKQERAERAKAEDFDAEEGEMLEEENEQEEEVFGQVGDLLGTLIKTFKASFLPFFQELTSYVMPMWGKDKTAEERRIAICIFDDVAEHCREAALKYYDTYLPFVLEACNDENPDVRQAAVYGLGVCAEFGGSVFKPLVREALSRLDAVIRHPNALHADNVMAYDNAVSALGKICQFHRDSIDAAQQIVPAWLSCLPIKGDLIEAKLVHDQLCSMVERSDQELLGPNNQYLPKIVAVFAEVLCAGKDLATEQTASRMINLLRHLQQSLPPSTLASTWSSLQPQQQLALQSILSS